A codebook adaptation algorithm for SCHMM using formant distribution

نویسندگان

  • Tae-Young Yang
  • Won-Ho Shin
  • Weon-Goo Kim
  • Dae Hee Youn
چکیده

This paper describes a codebook adaptation process improving the performance of speaker adaptation. The proposed method is performed prior to Bayesian speaker adaptation method using the formant distribution of adaptation data. The reference codebook is adapted to represent the formant distribution of a new speaker. The average recognition rate of Bayesian adaptation is improved from 91.4% to 95.1% using the proposed method. The proposed method is e ective particularly when there exists a large mismatch between the reference codebook and a target speaker in feature space. In this cases the average recognition rate is 95.0% while 89.9% is obtained when only Bayesian adaptation is performed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COMPARISON OF A NEW HYBRID CONNECTIONIST-SCHMM APPROACH WITH OTHER HYBRID APPROACHES FOR SPEECH RECO - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

This paper compares a newly proposed hybrid connectionist-SCHMM approach [5] with other hybrid a p proaches. In the new approach a multilayer perceptron (MLP) replaces the conventional codebooks of semicontinuous HMMs. The MLP is therefore trained on s w d k d basic elements (phones and phone parts) in such a way that the outputs of the network estimate the a posteriori probabilities of these e...

متن کامل

Title On-line adaptation of the SCHMM parameters based on the segmental quasi-bayes learning for speech recognition

In this correspondence, on-line quasi-Bayes adaptation of the mixture coefficients and mean vectors in semicontinuous hidden Markov model (SCHMM) is studied. The viability of the proposed algorithm is confirmed and the related practical issues are addressed in a specific application of on-line speaker adaptation using a 26-word English alphabet vocabulary.

متن کامل

An expectation maximization approach for formant tracking using a parameter-free non-linear predictor

This paper presents a new approach for formant tracking using a parameter-free non-linear predictor that maps formant frequencies and bandwidths into the acoustic feature space. The approach relies on decomposing the speech signal into two components: the first component captures the mapping between formants and acoustic observations, while the second component is intended to capture the residu...

متن کامل

On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition

In this correspondence, on-line quasi-Bayes adaptation of the mixture coefficients and mean vectors in semicontinuous hidden Markov model (SCHMM) is studied. The viability of the proposed algorithm is confirmed and the related practical issues are addressed in a specific application of on-line speaker adaptation using a 26-word English alphabet vocabulary.

متن کامل

Phoneme-Dependent Speech Enhancement

The majority of current speech enhancement systems are based on generalized signal-to-noise ratio dependent weighting rules and do not take into account the characteristics of the actual speech sound being processed. The following contribution is concerned with phoneme-specific speech enhancement methods that apply specially tailored signal processing methods. The first signal processing algori...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996